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On  the  Belatlonshlp  Between  the  R end  S Arrays  and  the 
Bo^Jenkins  Method  of  ABMA  Model  Identification 


Wayne  A.  Woodward  and  H.  L.  Gray 
Southern  Methodist  Wnlverslty 


1.  INTBODDCTIOH 


Grmy,  Kelley,  and  Mclntlre  (1978)  have  proposed  an  alterna- 
tive to  the  Bo»- Jenkins  approach  of  ARMA  (p,q)  model  identification 


based  on  Rr  and  S-arrsya.  Their  method  was  shown  to  perform  well 


in  practice,  and  it  uniquely  determines  p and  q when  the  true  auto- 


correlation function  is  known.  Using  the  Box-Jenkina  approach 


however,  when  the  autocorrelation  function  is  known,  unique  deter- 


mination of  p and  q is  only  assured  when  either  p > 0 or  q ■ 0 


latlon  function,  a natural  extension  of  the  partial  autocorrelation 


function,  is  discusaed.  It  will  be  shown  that  more  information  con- 


0.  It  will  also  be  shown 


that  the  generalised  partial  autocorrelation  function  can  be  obtained 
as  a ratio  of  elaaMnts  in  the  S-array,  and  that  some  significant  infor- 
mation in  the  S-array  can  hm  lost  when  the  ratio  is  taken. 


p t-p 


- e 

q t-q 


(2.1) 


where  k • 1,  ....  p and  6^,  J ■ 1,  ....  q are  real  valued  con- 
stants with  d ^ 0 and  6 0 and  where  a Is  white  noise.  Employing 

p q t 

the  backward  ahift  operator  B,  defined  by  BX^-  X^^^,  equation  (2.1) 

is  often  written  in  the  form  *(B)X^  - e(B)a^  where 

♦(B)  - 1 - ♦-B  - ...  - ♦ bP 
i P 

e(B)  - 1 - e.B  - ...  - e B**. 

* 

It  will  be  assumed  that  the  two  equations 


♦(r)  -i-*r-  ...  -♦!?  -0 
i P 

and 


e(r)  - 1 - e.r  - ...  - e r^  - 0 
1 q 

have  no  common  roots. 


Letting  denote  the  autocorrelation  function  at  lag  k,  it  be 
shown  that  if  X is  a stationary  ARMA.(p,q)  process,  then 


From  (2.3)  we  obtain  the  well-known  Tule-Walker  equations,  l.c 


Basic  to  the  Box-Jenklns  approach  of  ARMA  model  Identification  is 
the  partial  autocorrelation  function  given  by 
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Pk-2 

Pk-3 


..  P, 


^k-2  ‘’k-1 


W-3  ^k-2 


(2.5) 


*kk“ 


'’k-1  ^2  V3 


Pi- 


Vl  ‘’k-2  ‘’k-3 


That  is,  if  It  la  asstaad  that  the  process  actually  is  ABMA  (k,0), 
xagardlasa  what  the  modal  actually  is,  then  is  the  Cramer's  Rule  solu- 
tion of  tho  first  k Tttla-Valkar  equations  in  (2.4),  with  k ■ p,  for  the 
last  autoragrasslva  eoefficlant.  The  Box- Jenkins  procedure  uses 
the  fact  that  if  Z(t)  actually  isARMA(p,0)  then  4^^  is  nonsero  for 
k • p and  identically  zero  for  k > p.  Also  utilised  in  the  Box- 
Jenkins  proeadura  is  the  fact  that  if  the  proeaas  is  ARMA  (0,q)  then 
p^  • 0,  k > q.  That  proeadura  inyolvas  tbs  inspaetion  of  graphs  of 
tha  autoeorralation  fumetloa  smd  partial  autoeorralation  functions, 
loaavar,  shea  p and  q arc  both  graatar  than  saro,  tha  autocorrelation 
and  partial  autoeorralation  fmetion  do  not  possass  graphs  uhieh 
unlqusly  datarmiaa  p and  q by  simpla  inspaetion. 
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Upon  «3caminatlon  of  the  Tule-Walker  Equations  in  (2,4)  It  can 
be  seen  that  If  X(t)  is  ABMA  (p>q)  then  the  autocorrelation  function 
does  not  satisfy  the  first  q Tule-Walker  equations  but  does  satisfy 
equations  q -t-  1 and  following.  Using  this  observation  we  can  define 
the  generalised  partial  autocorrelation  (6PAC)  function  aa 


la  the  eraser's  Buie  solution  for  the  last  autoregressive 


coefficient  using  the  (j  'f  1)  throu^  (J  k)  Tule  Walker 


equations  In  (2.4)  with  p ■ k.  That  Is,  A 


It  Is  easily  seen  that  If  is  actually  ABMA  (p,q)  then  - 4^. 
Also,  if  X la  ABMA  (p,q)  then  4^^^  is  nonsero  for  k ■ p and  Identic 


cally  sero  for  k > p.  In  addition  If  the  process  Is  ABMA  (p,q)  one 


would  norsally  use  equations  q ■t'  1 through  q + p to  solve  for  the 


parasaters.  Bowever,  one  could  utilise  the  p eqiiatlons  q + 1 through 


and  obtain  the 


solution  for  the 


ThM*  properties 


Bake  It  possible  to  uniquely  identify  p and  q of  a Bind  process  by 


slaple  Inspection  of  s table  If  the  true  eutocorreletlon  function  Is 


known.  Granger  and  Hawbold  (1977)  note  Chet  Jenkins  has  also 


suggested  the  use  of  4. 


Two  Bethods  of  s< 


poses  of  Identifying  p and  q will  be  discussed.  A first  presentation 


Is  the  array  given  in  Table  1 


TABLE  1 


Order 


if  la  an  ARMA  (p,q)  process,  then  the  associated  array  will  have 
the  pattern  shown  In  Table  2. 
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TABLE  2 


Hovlng 

Avarag* 

Ordar 


q-1 

q 

q+i 


6PAC  Array  for  an  ABMA  (p«q}  Proeaas 
Autoragrasalva  Order 
1 p-1  p P+1 


P+2 


7^ 


*11  • 

.(q) 

'U  * 

.(q+1) 


lo5 755 755 

^P“l»P-l  pP  ^P+1*P+1  ’p+2.P+2 


,(q-l) 

'p-l,p-l  ’pp 

(q) 

p-i.p-i 


(q-l)  .(q-1)  .(q-1) 

^p+l,p+l  ^p+2,p+2 


.(q+1)  . 

♦p-l.P-1  ♦ 


u ■ undaflnad 


Thus  the  proeadura  is  to  saarch  for  a colon  p In  which  constant 
behavior  occurs  and  a row  q In  which  the  alanants  are  zero  for 
coluans  k,  k > p.  An  altamatlva  to  this  approach  la  a graphical 
procedure  In  which  each  row  of  Table  2 is  graphed  on  the  sane  set  of 
anas  and  the  pattazas  of  Table  2 are  viewed  graphically.  This 
approach  la  slallar  to  the  Boi^- Jenkins  gr^hlcal  approach. 

Gray,  Kalley,  and  Mclntlre  (1978)  approach  the  problem  of 
Identifying  p and  q by  defining  R and  S array  elements  as  the  followir^ 


ratios 


where 


Melntlre  procedure  depend  ere  eiaaerlsed  in  Theoren  1. 

Theorem  1.  Let  be  e etetlonery  AKHA(Pfq)  proceee  (p  > 0)  with 
eutocorreletlon  p-,  Suppoee  S (p  ) Is  defined  end  noncero  for  ell  m. 


Proof;  Proofs  for  (s)  and  (b)  are  given  by  Gray,  Kelley,  and  McXntlre 
(1978)  and  In  sore  detail  by  McXntlre  (1977) . The  proof  of  (c)  follows 
aasUy  fron  (2.7). 

The  S values  are  placed  In  an  S-«rray  as  In  Table  3 where  we  have 
employed  the  shortened  notation  S (m)  ■ S (f  ). 
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In  Table  A we  present  the  behavior  of  the  S-array  when  is 


AItMA(p,q)  and  £ ■ P • 

n n 


TABLE  4 


(S(p  )) 

In 


p+l  p+2 


Mote  constant  stretches 


The  following  theorem  establishes  the  relationship  between  th«.  S- 
arrsys  end  the  generalized  partial  autocorrelation  function* 
Theorem  2.  Let  be  the  autocorrelation  function  of  a stationary 
tiaa  series.  Then 

♦kk^  - - SjjCp_j^+l)/S^(p_l^.j)  • 


-£+l 

e 

e 

e 

-q-p-2 

-q-p-1 

-q-p 

s 

e 

e 

-Kl-p 

q-p+1 

e 

e 

e 

J 


S,(-i+l) 


Sj^  (-q-p-2) 
Sj^  (-q-p-1) 
S,(-q“P) 


fq  non- 
constants 


fl2q  non- 
2q  non-  iconstants 
constants  ^ 


Sj^(q“P) 

S,(q-P+1) 


S,(J) 


Proof;  We  vUl  let  |a|  and  |b|  denote  the  niaerator  and  denoalnator 
deteniilnaui.k  respectively  In  the  definition  of  in  (2.6).  By 


Also,  It  can  be  shown  that 


where  [ ] denotM  the  greatest  Integer  function.  Thus 


As  a practical  point  It  should  be  noted  that  althou^  the 
eleaents  of  the  GPAC  array  could  be  calculated  via  their  defining 
relation  (2.6),  their  calculation  is  facilitated  using  Theoren  2 and 


Bafor*  procaadlng  further  it  should  b*  polntad  out  that  a 
result  siallar  to  that  of  Theorem  1 holds  whan  f * C*** 

m H 

[4]).  6ray»  Kelley,  and  Mclntlre  recommend  the  Inspection  of  the 
S-srrsys  both  with  f ■ emd  with  f ■ (“l)"p_  since  the  model 
Identification  behavior  In  tha  presence  of  noise  Is  often  clearer 
In  one  of  the  arrays  than  In  the  other,  yet  neither  Is  uniformly 
better  than  the  other.  It  does  appear  that  for  low  frequency  data  It  Is 
better  to  take  f - C-1)*P-  computing  Table  3 and  for  high  frequency  data 


S-arrays  are  usually  better  at  f * p where  p_  will  be  defined  In  Section  3 

B B B 

Theorem  3 establishes  the  relationship  between  the  generalized 


partial  autocorrelation*  function  and  the  S-array  with  f^  ■ (-1)  p^ 
Theorem  3t  Let  p.  be  the  autocorrelation  function  of  a stationary 


Proof;  It  Is  easily  shown,  that  when  f ■ (**1)  p 


where  |a*|  and  |B*|  are  defined  as  were  |a|  and  |b|  in  the  proof  of 
Theorem  2 with  p_  replaced  by  (-l)*p..  Through  row  and  coliam 
operations  on  |A*|  and  |B*|,  It  can  be  shown  that  |A*i/|B*|  ■ 
(-1)^|a|/|b|  and  the  result  follows. 

Theorem  3 shows  that  the  CPAC  array  can  be  ealeolated 
from  the  S-array  at  f “ p or  f • (-l)*p_. 
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3.  MODEL  USMTIFICAIION  DSI1I6  THE  6PAC  ARSAT- 
ULDSTEATITE  EXAMPLES 


In  thin  Mctlon  wc  will  Uluatrstc  the  u*e  of  the  CPAC  errey 
through  three  exaaples.  In  these  ezawplee  the  autocorrelation 
function  will  be  estimated  baaed  upon  a realization  of  length  T 
by 


* •} 
I 

t-0  ' 


It  will  be  seen  that  the  patterns  In  the  6PAC  array,  although  somswfaat 
disturbed,  are  still  very  useful  In  Identifying  the  order  of  the  model. 
Bzanple  3 Indicates  a caution  which  must  be  exercised  In  the  use  of 
the  6PAC  array  for  determining  the  moving  average  order. 

Finally,  before  proceeding  to  the  examples  It  should  be  mentioned 
that  If  p ■ 0,  the  Bo»-Jenklns  procedure,  the  Gray,  Kall^,  Mclntlre 
procedure,  and  the  6PAC  procedure  all  must  utilize  the  autocorrelation 
function  with  Its  property  that  ■ 0,  k > q + 1. 

Example  1.  Consider  the  process 

X^  - 1.3A  X^_j^  + .65  X^_2  - a^.  (3.1) 

In  Table  5 the  GPAC  array  using  the  true  ..utocorrelatlon  function 
is  presented.  The  Identification  as  an  ARMA  (2,0)  process  is  clear 
doe  to  the  fact  that  6^  24^  * •••  khat  622^  * -.65, 

1 ■ 0,  1,  2,  ...  . In  Table  6 the  sample  generalized  partial 
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autocorrelation  function  array  ia  given  baaed  on  a realization  froa 
(3.1)  of  length  100.  The  pattema  which  occurred  exactly  in  Table  5 
are  atill  viaible  in  Table  6»  i.e. , 1 241 1 - *127,  i ■ 1*  ...  8 
and  $22^  % -.65,  i • 0,  ...  5.  Of  courae  the  proceaa  ia  atrietly 
autoregreasive  and  it  ia  clear  that  one  could  have  identified  the 
proceaa  aa  ABU  (2,0)  atrietly  on  the  baaia  of  the  flrat  row  of 
Table  6 which  is  the  partial  autocorrelation  function.  However,  aa 
will  ueually  be  the  caae,  the  conatant  behavior,  which  occurred  in 
the  aecond  coliavi  la  thia  exaaple,  ia  the  noat  viaible  pattern.  In 
maj  event  the  coihleatlon  of  both  ia  clearly  sore  infomatlve  than 
either  alone.  In  Exaaple  3 we  will  deal  sore  conpletely  with  the 
coaipariaon  of  the  nodal  identification  capabilitlea  of  the  conatant 
coluan  behavior  and  the  zero  row  behavior  in  the  GPAC  array. 

TABLE  5 

True  Generalized  Partial  Autocorrelation 
Function  Array  for  Serlea  (3.1) 

Autoregreaaive  Order 


1 

2 

3 

4 

5 

6 

7 

8 

0 

.812 

-.650 

.000 

.000 

.000 

.000 

.000 

.000 

1 

.540 

-.650 

u* 

u 

u 

u 

u 

u 

Moving 

2 

.135 

-.650 

u 

u 

u 

u 

u 

u 

Average 

3 

-3.458 

-.650 

u 

u 

u 

u 

u 

u 

Order 

4 

1.528 

-.650 

u 

u 

u 

u 

u 

u 

5 

.915 

-.650 

u 

u 

u 

u 

u 

u 

u ■ undefined 


TABLE  6 

Saaple  Generalized  Partial  Autocorrelation  Function  Array 
for  a Heallzation  of  Length  100  from  Seriea  (3.1) 

Autoregreaalve  Order 


.821  -.650  .029  -.114  -.075  -.127  -.034 

.562  -.628  -2.487  -.133  .116  -.107  -.370 


Moving  2 .209  -.702  .392  .232  -.196  -.170  -.160 

Average  3 -1.894  -.715  .873  1.143  -.738  -.278  -.167 


.232  -.196  -.170  -.160 


Order 


292  -1.692  -. 


Rxanple  2.  Consider  the  process 

- 1.5  X^_j^  + 1.21  X^_2  “ *^55  X^_3  " *t  •^‘t-l  * •®*t-2* 

In  Tables  7 and  8 are  given  the  GPAC  arrays  based  on  theoretical 
autocorrelations  and  sample  autocorrelations  from  a realization  of 
size  300  respectively.  In  Table  7 the  fact  that  ^ ■ 0^  i " 

1,  2,  ...  and  that  4^3^  - 433''^^  - .455,  i - 1,  2,  ...  indicates  that 
p ■ 3 and  q " 2.  These  patterns  again  are  also  quite  clear  in  Table  8 


TABLE  7 


Moving 

Average 

Order 


True  Genarallzed  Partial  Autocorrelation 

Function  Array  for  Series  0.2) 
Autoregressive  Order 
5 


3 .328  2.073 

4 1.356  -.119 

5 1.632  5.367 

*u  ■ undefined 


5 .062 

9 .848 


. 


TABLE  8 


0 .819  -.725  .344  .315  -.164  -.177  .145  .105 

1 .528  -.540  .930  .477  -.494  -.307  .270  .271 

Moving  2 .165  -.309  .458  -.017  .101  .014  .004  .037 

Average  3 -1.124  -.204  .452  2.850  .104  -.017  -.123  .045 

Order  4 .296  -1.048  .468  .024  .092  -1.702  -.316  .072 

5 -4.174  -1.596  .467  -1.771  .145  .005  .294  .393 


3.  In  Table  9 la  preaenced  the  GPAC  array  using  the  true 


In  that  array  It  ahould  be  noted  that 


Since  the  process  Is  ABMA  (2,1)  we  would  have  expected  ■ - .75 
J > 1 and  ^ - .75.  This  exanple,  however,  points  out  that  tha 


constant  behavior  in  the  p coltan  of  the  GPAC  array  of  a sta- 


tionary ABMA  (p,q)  procass  aay  bagln  prior  to  row  q.  In  ether  words 
If  the  procass  actually  is  ABMA  (p,q),  with  4^  the  p^^  autore- 
gressive coefficient,  and  it  la  treated  as  an  ABMA  (p,q-l),  for 
axaapla,  tha  aatiaata  of  tha  p^  autoregressive  coefficient  in  this 
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using  the  GPAC  array  to  deteznlne  q.  The  constant  behavior  In  a 
column  of  the  GPAC  array  should  be  accoaq>anled  by  the  appropriate 
aero  behavior.  It  la  clear  that  for  a stationary  ARMA  process, 

■ 0 for  k > p + 1 and  1*  0 If  and  only  If  the  process  la 
ARMA  (p,q).  Thus,  In  the  array  In  Table  9,  although  the  constant 
behavior  la  misleading,  the  aero  behavior  correctly  Identifies  p and 
q.  We  will  formally  state  the  results  of  the  above  discussion  In 
tha  following  theorem. 

Theorem  4.  Let  be  a stationary  ARMA(p,q)  process  (p  >0). 

(•)  - 4*  . j > q* 

pp  p - 

(b)  Suppose  that  the  nth  order  Yule-Walker  equations  are  non- 

slngxilar.  For  some  constant  m,  6^®^  - 0,  k > n+1  and  6^®^  T*  0 

Kk  — nn 

Iff  n ■ p and  m ■ q. 

Referring  back  to  Theorem  1 and  the  corresponding  results  for 
Ic 

f|^  " (>1)  P|^  (see  14])  It  la  seen  that  the  conatant  behavior  in  the 
S-array  la  necessary  and  sufficient.  In  Table  10  la  presented  tha  2®^ 
coluan  of  the  S-array  with  fj^  * P|^  from  which  the  GPAC  array  In  Table  9 
was  calculated.  We  see  that,  aa  would  be  eiepeeted  from  Theorem  1,  the 
aablgulty  seen  In  Table  9 Is  not  present  In  Table  10.  The  additional 
conatant  term  lu  Table  9 was  due  to  the  fact  that  1.749/2.333  ■ 
1.360/2.080  t'j  3 decimal  places.  It  la  obvious  that  sequences  of  con- 
stants in  the  OAC  array  doe  to  constant  ratios  with  numsratore  and 
denominators  which  vary,  can  In  fact  occur  In  any  column  of  the  GPAC 
array.  Thus,  tha  S-array  should  always  be  checked  whan  considering  con- 
stant behavior  in  the  QPAC  array.  In  Section  5 we  will  demonstrate  that 
the  sero  behavior  of  GPAC  can  also  be  misleading.  This  problem  la  also 
alleviated  via  the  S-array  as  w«  will  aee. 


rMBMlaatlon  of  th«  S-arr«y.  For  a atationary  ABHA  (p,q)  proeaaa, 
tha  eoBataat  bahavlor  in  tha  p^^  colunn  of  the  GPAC  array  ralataa 
eloaaly  to  tha  eonatanta  and  C2  in  tha  p^^  coluan  of  tha  5-array 
Tha  "saro"  bahavlor  in  tha  CSPAC  array  In  turn  corraaponda  to  Thaorai 
1(c).  That  la  If  la  ARMA  (p,q)  than  for  1 > 1,  2»  ...  wa  have 

- «-l)\  -Wl*  - 

“ ®p+l^‘’-p-14q+l^^®p+l<P-p-i-q^  ■ 0*  Kallay,  and  Mclntira 

(1978)  raeoanand  utilizing  the  Infomatlon  In  roliann  p 4-  1 in  tha 


identification  of  an  ARMA  (p^q)  procaaa,  but  they  do  not  auggaat 
utilizing  the  behavior  daacrlbad  In  Thaoren  1(e)  for  1 > 1.  In  fact 
tha  D-atatlatlc  (aea  [4])  utillzaa  eoluana  p and  p 4-  1 only  (In 
addition  to  column  p 4'  1 of  tha  I^array).  Thua  tha  S-array  approach 
uaaa  tha  conatant  bahavlor  and  tha  flrat  zero  In  tha  row  bahavlor  of 


tha  GPAC.  It  aaeaa  that  ona  la  In  fact  Ignoring  uaaful  modal  Idantl* 
flcatlon  Information  by  not  conaldarlng  tha  bahavlor  In  tha  columna 
paat  tha  p 4-  1 **  columna  of  the  S-array.  Tha  authora  are  currently 
conaldarlng  a modification  of  tha  D-atatlatlc  with  thaaa  connanta  in 


Tha  S-array,  howavar,  la  not  praaantly  daalgnad  to  aaally 
faelUtata  auch  an  aril  nation  of  tha  Thaoram  1(c)  bahavlor  aa  ona 
muat  procaad  diagonally  throu^  tha  array  to  aramina  thla  bahavlor. 
To  allavlata  thla  problam,  a modification  of  the  S-array,  to  ba 
called  tha  Shifted  S-array  will  ba  Introduced.  Thla  Shifted  S-array 


was  first  proposed  by  G.  Kelley  in  1977.  Letting  denote  the 

element  In  the  column  at  lag  J of  the  Shifted  S-array,  we  have 


Thus,  Theorem  1 can  be  restated  as  follows 


Theorem  1 * ; Let  X be  a stationary  ABMA(p,q)  process  (p  > 0)  with 


autocorrelation  p . Suppose  that  S*(p  ) Is  defined  and  nonxero  for  all  m 
m n m 

(a)  For  some  Integer  m^  and  som  constant  ^ 0, 

S*(p_)  - C, , m > m- 


Thus,  if  is  a stationary  AKMA  (p,q)  process,  then  in  colians 
p 4 1,  1 • 1,  2,  ...  the  value  ± • will  occur  at  lag  -q-1  and 
(-l)*Cj  wiU  occur  at  lag  q.  In  addition  to  sii^>Ufylng  the  Theorem 
1(c)  behevlor.  Theorem  l'  end  Table  11  indicate  that  the  constaat 
behavior  will  also  be  simplified.  In  the  Shifted  S-err^  for  a 
stationary  AIMd  (p,q)  process  the  2q  nonrconatant  terms  will  always 
be  the  elements  from  -q  through  q>l  regardless  of  p.  That  is,  the 
nonconstsnt  behavior  will  always  be  centered  around  lags  >1  and  0. 
This  will  simplify  the  use  of  the  Infemetion  in  the  S-array  and  will 

eliminate  the  need  for  the  "starred  quantities"  to  assist  in  locating 
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the  correct  position  for  inspection  of  a particular  column.  In  Table 
12  is  presented  the  theoretical  Shifted  S-array  for  the  AKHA  (3,2) 
process  of  Exasiple  2 while  in  Table  13  is  presented  the  Shifted 
S-array  for  the  realization  of  the  length  300  upon  which  the  GPAC 
array  in  Table  8 is  based.  From  Table  12,  the  ± •>  behavior  occurs 
at  lag  -^<-1  " -3  beginning  in  eoliani  4.  In  Table  13  this  behavior 
is  manifested  in  the  "large"  numbers  268.053,  -42.435,  -319.257.  At 
lag  q ■ 2 in  columis  4,  5,  and  6 the  quantities  4.494,  -4.300,  and 
4.454  respectively  correspond  to  the  (-1)^0^^  behavior  of  Theorem 
1(c) . These  observations  along  with  the  two  sets  of  near  constants 
in  column  3 of  Table  13  Identify  the  process  as  an  ARMA  (3,2).  Of 
course  the  nonconstant  behavior  is  centered  at  the  line  drawn  between 
lags  0 and  -1.  It  is  the  opinion  of  the  authors  that  the  Shifted 

t 

S-array  presents  the  information  in  the  S-array  in  a format  which  is 
easier  to  use  in  practice. 

5.  A CQMPBEHENSIVE  EXAMPLE 

In  this  section  we  employ  the  model  identification  techniques 
mentioned  in  this  paper,  naswly  the  GPAC  array  and  the  Shifted  S-array 
to  model  the  Makrldakis  (1978)  metal  series  data.  It  is  hoped  that 
this  ezsmple  will  demonstrate  the  use  of  these  techniques  in  modeling, 
rather  easily,  a series  which  has  been  difficult  to  handle  with 
previous  techniques. 

The  metal  series  data  consists  of  144  monthly  values  of  carbon 
steel  monthly  shipments  from  1961-1972.  Makrldakis  has  fit  the  model 
(1  - B)  - a - e^B)(l  - ej^B^^)  a^,  (5.1) 

where  - 3082  and  X^  is  the  data.  He  reports  that  this  model 
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TABLE  12 


Shifted  S-Arrey  for  Hodel  (3.2) 


n 

^5 

S*4 

»! 

H 

-8 

-2.872 

-7 

-2.037 

3.552 

“6 

-1.613 

1.247 

-9.154 

-5 

-1.737 

23.860 

-9.154 

u 

-4 

-4.052 

2.915 

-9.154 

u 

u 

-3 

-3.556 

22.437 

-9.154 

±» 

±« 

±« 

-2 

-2.651 

5.686 

-7.573 

1.470 

6.806 

-7.9U 

-1 

-2.184 

4.456 

-10.750 

-10.450 

7.148 

-65.674 

0 

-1.845 

3.148 

-4.452 

3.122 

-2.173 

2.489 

1 

-1.606 

2.606 

-6.334 

-1.004 

-3.527 

-1.090 

2 

-1.391 

1.578 

-4.165 

4.165 

-4.165 

4.165 

3 

-1.328 

-6.044 

-4.165 

u 

u 

4 

-2.356 

2.838 

-4.165 

u 

5 

-2.632 

-6.691 

-4.165 

6 

-1.964 

3.598 

7 

-1.534 

TABLE  13 

Shifted  S-Aarray  for 

Keallzetlon 

of  Length  300  fram  Hodel  (3.2) 

H 

H 

n 

-8 

-2.312 

-7 

-1.628 

3.467 

-6 

-.760 

2.386 

-9.505 

-5 

-4.375 

1.347 

-9.472 

-177.715 

-4 

-.110 

6.459 

-9.579 

8.022 

-48.460 

-3 

-7.064 

6.664 

-9.543 

268.053 

-42.435 

-319.257 

-2 

-2.893 

5.049 

-7.264 

-1.905 

10.998 

2.990 

-1 

-2.221 

4.329 

-12.256 

-9.160 

14.689 

16.074 

0 

-1.819 

3.138 

-4.218 

2.888 

-2.414 

2.840 

1 

-1.528 

2.728 

-6.752 

.908 

-5.435 

.919 

2 

-1.165 

2.059 

-4.372 

4.494 

-4.300 

4.454 

3 

.124 

1.319 

-4.334 

-22.865 

-5.046 

4 

-1.296 

1.412 

-4.438 

4.211 

5 

3.174 

3.807 

-4.435 

6 

-2.593 

3.322 

7 

-1.762 

m 
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does  not  forecast  as  veil  as  the  randoa  walk  mo.  .1,  an  apparent 
paradox  since  he  believes  this  to  be  the  optimum  model.  Farzen  (1979) 
also  analyzes  the  data  and  suggests  the  solution  to  the  paradox  is 
that  the  model  of  Makrldakas  is  not  satisfactory.  Farzen  identifies 
the  AR  (2)  process 


^t  ” ^t~l  ” ^t“2  " *t 


(5.2) 


as  the  best  choice  according  to  Farzen* s (1974)  CAT  (criterion  auto- 
regressive transfer)  criterion.  The  same  result  is  obtained  using 
Akalke's  (1974)  FFE  criterion.  However,  Farzen  notices  that  the 
sample  spectrum  does  not  agree  very  well  with  the  spectrum  of  the 
AR  (2)  model  (5.2)  end  that  the  second  choice  according  to  CAT  is  an 
AR  (13)  (The  second  choice  by  FFE  is  AR  (3).).  Moreover  the  spectrum 
obtained  from  the  fitted  AR  (13)  is  quite  compatible  with  the  sample 
spectrum.  For  this  reason  Farzen  prefers  the  AR  (13)  model  even  though 
it  is  not  selected  as  best  by  either  CAT  or  FFE. 

A plot  of  the  metals  data  is  given  in  Figure  1 and  the  sa^>le 
autocorrelations  are  shown  in  Figure  2.  Table  14  shows  portions  of 
the  shift 'id  S-array  for  the  data  with  f - (-D'^P  . The  array  la  evaluated 

■ m 

with  f ■ (-l)‘p  because  the  data  is  clearly  predominantly  low  frequency 
m m 

(see  Gray,  Kelley,  Meintire  (1978)).  For  that  array  the  D-statlstlc 
suggests  that  the  array  be  examined  for  the  poaslblllty  of  an 
ABMA  (1,1),  ARMA  (1,2),  ARMA  a>6)  or  an  ARMA  (13,1)  or  AR  (13). 

Examination  of  the  shifted  S-array  immediately  suggests  from  columns 
1 and  13  that  the  process  is  ARMA  (13,1)  with  a first  order  factor  of 
approximately  (1  - .9B).  Moreover  since  the  two  nonconstant  values 
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In  colon  13  nr*  cion*  to  tho  "constants"  of  column  13  It  Is  clear 
that  the  moving  average  term  la  not  large.  In  keeping  with  the  pro- 
cedure of  Gray,  Kelley,  and  Kclntlre,  the  data  was  transformed  by 
estimating  the  coefficient  of  1 in  (1  #B)  by 

8,(1)  + Sj(2)  + S (3) 

4 . -i % .9  . 

Si(-2)  + S^(-3)  + S^<-4) 


TABLE  14 


Portion  of  Shifted  S-Arrey  for 
Makrldakls  Metals  Series 


Sf 

^3 

®f4 

ns 

-5  -2.240 

.091 

-.399 

-3.079 

2.143 

-5.921 

4.662 

-4  -2.134 

.406 

-1.313 

1.780 

2.755 

-.400 

2.006 

-3  -2.206 

-3.458 

3.731 

-4.595 

3.020 

-2.152 

1.174 

-2  -2.048 

-4.468 

18.637 

.969 

3.263 

-17.066 

1.676 

-1  -2.493 

-3.168 

-30.088 

. . . -3.951 

3.008 

8.151 

10*565 

0 -1.670 

1.093 

-1.135 

.815 

-.641 

.696 

-.653 

1 -1.954 

1.254 

-1.511 

-.201 

-.953 

1.156 

-.197 

2 -1.829 

1.149 

5.814 

1.236 

-.935 

-1.811 

-.134 

3 -1.882 

-.424 

.117 

-.488 

-.731 

-.m 

-1.535 

4 -1.807 

-.081 

-4.636 

1.203 

-.657 

1.229 

-1.344 
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It  is  of  eourss  aors  convsntionsl  to  dlffsrsnes  ths  data  Ic  such 
clrcuBStancas  as  this.  Bowavar,  thi>  data  claarly  doas  not  indicata 
a strong  tandancy  toward  nonatationarity  but  doas  suggast  ths  possi- 
bility of  a root  naar  C.9)'^  in  tha  eharaetariatic  aquation.  Sinca 
as  has  baan  dawonstratad  by  Gr^r,  Kallay,  and  Mclntira  (1978),  a 
factor  of  tha  fora  (1  - .9B)  is  of tan  closa  anou^  to  tha  non- 
stationary ragion  to  produca  a naarly  singular  autocorrelation 
function,  tha  S-array  of  (1  - .9B)X^  should  ba  asaainad  as  confiraa- 
tion  of  tha  tantatiaa  idantification  of  tha  process  as  sn  ARKA  (13,1). 
Althou^  in  aost  problaas  it  ashes  little  diffaranca  at  this  stags  of 


tha  idantification  process  whathar  one  transfoias  tha  data  by  tha 
difference  operator  1 - B or  tha  operator  1 - .9B,  these  authors  have 
found  soaa  cases  where  this  sassdng  saall  change  in  tha  operators  esn 
have  a large  affect.  Tha  shifted  S-array  of  (1  - .9B)X^  is  shown  in 
Tsbla  15.  Tha  suggested  aodal  there  is  clearly  an  ABMA  (12,1)  and 
tha  aowing  average  effect  is  obviously  not  large.  This  is  consistent 
with  out  initial  identification  of  an  ARKA  (13,1).  Thus  tha  Identified 
■odal  is 


♦(B)X^  - (1  - e^B)a^ 


(5.3) 


irtMta 

♦ (B)  - 1 - - BjB^  - ...  - ijjp  . 

Bstiaation  of  these  coefficients  shows  only  4^^,  4^^,  4]j  sad  6^  to 
bo  significantly  different  frow  sero  and  hence  suggests  the  parslao- 
nlous  nodal 


si:*-'-  ' " 


'■  'fy 


i 

» 

I 

t 

' TABLE  15 


Portion  of  Shlftnd  S-Array  for  ■ Cl-.9B}X^ 
wliara  is  the  Makridakis  Katals  Series 

Cf,  - (-1)\) 


H 

-5 

-.187 

.252 

.657 

2.007 

-1.051 

3.166 

-1.475 

1.989 

-A 

-.264 

3.037 

1.534 

-3.219 

-1.239 

-.300 

-1.746 

-2.014 

-3 

1.298 

-1.789 

.410 

4.449 

-1.437 

1.504 

-1.848 

-6.442 

-2 

2.560 

-7.586 

-.570 

-1.381 

-1.422 

-43.835 

.258 

4.163 

-1 

1.188 

5.875 

U.809  ... 

2.056 

-1.181 

-2.534 

-4.810 

4.054 

0 

-.543 

.598 

-.569 

-.397 

.297 

-.337 

.315 

-.292 

1 

-.719 

.853 

.092 

.220 

.494 

-.480 

-.054 

-.350 

2 

-.565 

-1.311 

-.075 

-.973 

.496 

-9.085 

.372 

-.387 

3 

.359 

-.252 

-1.514 

1.040 

.401 

-.069 

.498 

-.260 

4 

.230 

4.512 

-.802 

-1.444 

.356 

-.584 

.485 

-3.975 

C5.4) 


(1  - 48)  (1  - - a - eB)a^. 

Initial  aatiaatea  for  ^ 6 are 

♦ - .9,  ♦ - .35»  e - .46  . 

Tha  apactral  danalty  of  tha  ARM4  (13,1)  nodal  fittad  from  (5.3)  la  ahown  in 
Figure  3.  Thia  la  aaaentlally  tha  aaM  apactral  danalty  aa  that  found  by 
Parsan  and  aa  aantionad  ia  quite  conalatent  with  tha  aaapla  apactral  danalty. 

Thua  far  wa  have  not  conaidarad  tha  ganarallzed  partial  auto- 
correlation In  thla  esaapla.  Thla  la  to  aoaa  axtant  due  to  tha  fact 
that  aa  aantionad  before,  Ita  coluan  behavior  la  not  aufflclent  to 
eharactarlza  an  ABMl  (p,q)  procaaa,  even  thoti^  ita  row  behavior  doaa 
charactarlza  auch  a procaaa.  Navarthaleaa  tha  ganarallaad  partial 
autocorrelation  la  quite  uaaful  and  thaaa  authora  alwaya  conalder  the 
GPAC  array  aa  wall  aa  tha  S-array.  In  thla  way  there  la  no  danger  In 
alaconatrulng  tha  conatant  colom  behavior  found  In  tha  GPAC  array. 

In  thla  azanpla  the  GPAC  praaanta  tha  aodal  Identification  quite  nicely 
and  at  tha  aaaa  tlaa  ahowa  why  tha  aero  behavior  In  tha  rowa  ahould  not 
be  uaad  alone  for  identification,  even  for  A8  (p)  nodela  only.  Table  16 
ahowa  tha  GPAC  for  tha  aatala  data  and  Tabla  17  for  (1-.98)JC^.  Bow  1 
of  Tabla  16  claarly  danonatrataa  why  CAT  and  FPB  choae  an  AR  (2)  aodal. 

That  ia,  alnca  through  are  all  approzlaately  aero  It  la  not 

attrprlalng  that  CAT  and  IPI  pick  aa  AR  (2).  On  tha  othar  hand  nota  that 
coltBB  2 In  tha  GPAC  la  not  avaa  approslaataly  conatant  and  hanca  tha 
long  atrlng  of  aarea  la  row  1 la  only  auggoative  of  a nunber  of  aaro 
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co«££lcl«nts  prior  to  That  is,  the  row  and  column  bahavlor 
togathar  suggest  tha  procasa  la  not  AK  (2)  but  does  have  a nui^r  o£ 
saro  eoaf£lclants  £ollowlng  ^2*  claarly  aaan  In 

tha  shl£tad  S-array  wbara  eolun  2 showa  no  constancy  bahavlor.  In 
that  array  tha  aaro  bahavlor*  aaan  la  row  1 o£  tha  QPAC  array*  la 
obtalnad  £ron  tha  ratio  o£  tha  alamants  S^C-l)  and  S|^(0}*  l.a.  thoaa 
on  aach  aids  o£  the  eantar  Una.  Thus  tha  S-array  or  ahl£tad  S-array 
and  6PAC  usa  both  tha  row  and  column  behavior  for  Idantlllcatlon  of 
a process.  Tha  Information  gained  by  this  procedure  la  clearly 
damonatratad  In  this  example  where  one  can  easily  aaa  that  tha  con- 
stant column  bahavlor  la  Influenced  by  all  of  tha  coefficients  even 
If  a number  of  saroa  Ua  between  tha  first  and  tha  last  autoragrasslva 
coefficients.  However  tha  saro  bahavlor  of  rows  gives  no  Information 
as  to  tha  values  of  for  larger  values  of  k*  l.a.  thoaa  further  out 
than  tha  saroa.  Tha  ARKA  (13*1)  Identified  by  tha  shifted  S-arr^  la 
also  vividly  displayed  by  tha  6PAC  array.  Note  that  tha  ARKA  (13*1) 
la  tha  only  choice  which  shows  both  the  iroper  row  and  column  bahavlor. 
Navertholaas  tha  large  values  .954*  .826,  .882,  ate.  In  row  1 suggests* 
for  reasons  already  mentioned*  that  It  Is  a good  Idea  to  transform  tha 
data  to  more  stationary  bahavlor  (as  we  did)  before  making  tha  final 
Identification. 

In  concluding  this  example  wa  make  two  final  eommants.  Tha 
statistic  which  has  bean  recommended  [ 4 ] as  an  aid  in  directing 
the  Invaatlgator  to  saUant  patterns  In  tha  S-arrays  amaauraa  Jointly 
both  tha  constant  column  bahavlor  of  tha  S-array  and  tha  saro  row 


la  probably  ovarly  aanaitlva  to  palxwlaa  conatant  bahavior  and  tha 


■odlflcationa  of  thla  are  now  undarway  and  It  la  hopad  that  It  aay 
avantoally  avolva  aa  aora  than  a gulda  to  Inapactlon  of  tha  S-array 
but  aa  a dapandabla  aatlaator  of  p and  q for  tha  ABMA  (p«q)  procaaa. 
Finally  aa  a word  of  caution,  tha  6PAC  la  a very  uaeful  aaaaura^ 
howavar  Eraavle  3 which  denonatrataa  that  Ita  column  bahavior  la  not 
aufflclant  to  Identify  q la  not  pathological  and  theae  authora  have 


encountered  a number  of  real  data  aata  where  the  behavior  denonatrated 


In  Example  3 waa  obaarvad.  Whan  theae  obaervatlona  are  coupled  with 
tha  pxaaent  example  which,  daaonatratea  tha  aero  bahavior  alone  can  alao  be 


aialaadlng.  It  la  clear  that  tha  S-array  nuat  be  conaultad,  being 


the  only  meaaure  which  uniquely  characterlzaa  the  ABKA  (p,q)  by  both 


arriQra.  However,  believing  that  ahlftad  S-arraya  are  aaalar  to  "read' 


for  moat  uaera,  our  futura  rafaranca  to  S-arraya  will  naan  ahlftad 


In  thla  paper  we  have  denonatrated  tha  uaa  of  tha  ganarallzad 


Janklna  aethod.  In  addition  wa  have  denonatrated  tha  ralatlonahlp 


between  the  S-  and  R-array  approach  of  Gray,  Kelley  and  Mclntlra  and 


th«  gwMraliscd  partial  autoeorralatlon.  That  la,  wa  have  ahown  that 
tha  ganarallsad  partial  autoeorralatlon*  such  Ilka  tha  I>-atatlatlc, 
rapraaanta  a eondanaatlon  of  tha  InfoxMtlon  In  tha  S^array.  Howavar, 
It  la  alao  ahown  that  tha  Infotaatlon  In  tha  GPAC  array  can  at  tlaaa 
ba  mlalaadlng.  Wa  thua  racoaaand  tha  uaa  of  tha  GPAC  array  along 
vlth  tha  Rr  and  S-arraya  and  tha  D-atatlatlc.  Actually  wa  raco— and 
tha  uaa  of  a alight  nodlfleatlon  of  tha  S-array  which  wa  call  tha 
ahlftad  S-array  bacauaa  It  praaanta  tha  Information  In  tha  S-array  In 
a mora  aaally  Intarpratabla  fora. 
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In  this  paper  an  extension  of  the  partial  autocorralatlon  function  which  we 
call  the  generalised  partial  autocorrelation  function  la  investigated.  This 
generalised  partial  autocorrelation  function  la  useful  In  examining  the  re- 
lationship between  the  R-  and  S-array  method  of  Gray,  Kelley,  and  Melntlre 
and  the  Box-Jenklns  approach  to  ARMA  model  Identification.  Also  the  general- 
ised partial  autocorr^atlon  la  shown  to  be  a useful  model  Identification 
tool  to  be  used  along  with  the  R-  and  S-arrays.  Also  discussed  Is  a re- 
formatting of  the  S-array  Into  the  Shifted  S-a 
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